AGGCTGTCCCACCCACCATCTGCACCCGCTGCAGCGCCCGCGCCCCTGTCCCGCGCCGTAGTCGTCATTTGTAGCCC 
GCCTGCCGCTCCCGGGGACCCGATCCTACCCTGGGTGCGGGGCAGAGCGGGCATGGCCCGTCTGGGGACCGCCTGCC 
CTGCGCTGGCGCTGGCCCTGGCACTTGTGGCGGTGGCCCTGGCTGGAGTCAGAGCCCAGGGCGCAGCCTTCGAGGAG 
CCTGACTATTACAGCCAGGAGCTCTGGCGGCGCGGGCGCTATTATGGGCATCCGGAGCCTGAGCCGGAGCCGGAGCT 
CTTCTCGCCTTCAATGCATGAAGACCTTAGGGTGGAGGAGCAGGAACAGCAGGAGCCGCACCAGCAGGGCCACAGAA 
CTCCCAAGAAGGCCATCAAGCCCAAGAAGGCTCCCAAGAGGGAGAAGTTAGTTGCAGAGACGCCTCCACCAGGTAAA 
AATAGCAACAGAAAAGGCAGAAGAAGCAAGAATCTTGAGAAAGCTGCCAGTGATGACCATGGTGTCCCTGTGGCTCA 
TGAGGATGTCAGAGAGAGTTGCCCACCTCTTGGTCTGGAAACATTAAAAATCACAGACTTCCAGCTGCATGCCTCCA 
CATCGAAGCGTTATGGCCTGGGAGCCCACCGGGGGAGACTCAACATCCAGGCAGGCATTAATGAAAATGACTTTTAC 
GATGGGGCTTGGTGTGCTGGTAGGAACGACTTGCATCAGTGGATCGAAGTGGATGCCCGGCGCCTGACCAAGTTCAC 
AGGGGTCATTACCCAAGGAAGGAACTCTCTCTGGCTGAGTGACTGGGTGACATCCTATAAAGTCATGGTGAGCAATG 
ACAGCCACACATGGGTTACTGTGAAGAATGGATCTGGCGACATGATATTTGAAGGAAACAGTGAGAAGGAGATTCCT 
GTGCTCAATGAGCTGCCAGTCCCCATGGTGGCCCGCTACATTCGCATAAACCCTCAGTCCTGGTTTGATAACGGGAG 
CATCTGCATGAGGATGGAGATCTTGGGCTGCCCACTGCCGGATCCTAATAACTATTATCACCGACGTAATGAGATGA 

CC AC C AC GG ATGAC CTGGATTTTAAGC AC C AC AAC T ATAAGG AAATGC GC C AGTTGATGAAGGTTGTC AATG AAATG 
TGCCCCAATATTACCAGGATTTACAACATTGGCAAAAGCCACCAGGGCCTGAAATTGTATGCGGTAGAGATCTCTGA 
CCATCCTGGGGAACATGAAGTTGGTGAGCCCGAGTTCCACTACATCGCAGGGGCCCACGGCAATGAGGTTCTGGGAC 
GAGAACTGCTGCTGCTGCTGCTGCACTTCCTCTGCCAGGAATACTCGGCGCAGAACGCACGCATCGTCCGCTTGGTG 
GAGGAGACTCGAATCCACATTCTACCCTCCCTCAATCCTGATGGCTATGAGAAGGCCTATGAAGGAGGTTCCGAGTT 
GGGAGGCTGGTCCCTGGGACGTTGGACCCATGATGGCATCGATATCAACAACAACTTTCCGGATTTAAACTCGCTGC 
TCTGGGAGGCAGAGGACCAGCAGAATGCCCCAAGGAAGGTCCCCAACCACTACATTGCCATCCCTGAGTGGTTTCTG 
TCTGAGAATGCCACAGTGGCCACAGAGACCAGAGCCGTCATCGCCTGGATGGAGAAGATCCCGTTTGTGCTGGGAGG 
CAACCTACAGGGGGGTGAGCTGGTCGTGGCATACCCCTATGACATGGTGCGGTCCCTGTGGAAGACCCAGGAGCACA 
CCCCAACACCTGATGATCATGTGTTCCGCTGGCTGGCGTATTCCTACGCCTCCACTCACCGCCTCATGACAGATGCC 
AGGAGGCGAGTGTGCCACACGGAAGATTTTCAGAAGGAGGAGGGCACCGTCAATGGGGCTTCCTGGCACACAGTGGC 
TGGAAGTCTAAACGATTTCAGCTACCTCCATACAAACTGCTTTGAGCTGTCCATCTACGTGGGCTGTGATAAATACC 
CACACGAGAGCGAGCTGCCGGAGGAATGGGAGAATAACCGGGAGTCTCTGATTGTGTTCATGGAGCAGGTTCATCGA 
GGCATCAAAGGCATAGTGAGAGATTTACAAGGGAAAGGGATTTCAAATGCTGTCATCTCTGTGGAAGGTGTTAACCA 
TGACATCCGGACAGCCAGCGATGGGGATTACTGGCGTCTACTGAACCCTGGCGAATATGTGGTCACAGCCAAGGCGG 
AAGGCTTTATCACTTCCACCAAGAACTGCATGGTTGGCTATGATATGGGAGCTACTCGGTGTGACTTCACCCTCACA 
AAGACCAACCTGGCTAGGATAAGAGAAATTATGGAGACATTTGGGAAGCAGCCTGTCAGCCTACCCTCCAGGCGCCT 
GAAGCTGCGGGGACGGAAAAGGCGGCAGCGTGGGTGACCCTGTCGGACACTTGAGACATACCCCAGACCGTGCAAAT 

AAAAATCCACTCCAGTAGTAAAAAAA 

(SEQ ID N0:1) 



MARLGTACPALALALALVAVALAGVRAQGAAFEEPDYYSQELWRRGRYYGHPEPEPEPELFSPSMHEDLR 

VEEQEQQEPHQQGHRTPKKAIKPKKAPKREKLVAETPPPGKNSNRKGRRSKNLEKAASDDHGVPVAHEDV 

RESCPPLGLETLKITDFQLHASTSKRYGLGAHRGRLNIQAGINENDFYDGAWCAGRNDLHQWIEVDARRL 

TKFTGVITQGRNSLWLSDWVTSYKYWSNDSHTWTVKNGSGDMIFEGNSEKEIPVLNELPVPMVARYIR 

INPQSWFDNGSICMRMEILGCPLPDPl^YYHRRNEMTTTDDLDFKHHISreKEMRQLMKVWEMCPNITRIY 

NIGKSHQGLKLYAVEISDHPGEHEVGEPEFHYIAGAHGNEVLGRELLLLLLHFLCQEYSAQNARIVRLVE 

ETRIHILPSLNPDGYEKAYEGGSELGGWSLGRWTHDGIDINNNFPDLNSLLWEAEDQQNAPRKVPNHYIA 

IPEWFLSENATVATETRAVIAWMEKIPFVLGGNLQGGELWAYPYDMVRSLWKTQEHTPTPDDHVFRWLA 

YSYASTHRLMTDARRRVCHTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKYPHESEL 

PEEWENNRESLIVFMEQVHRGIKGIVRDLQGKGISNAVISVEGVNHDIRTASDGDYWRLLNPGEYWTAK 

AEGFITSTKNCMVGYDMGATRCDFTLTKTNLARIREIMETFGKQPVSLPSRRLKLRGRKRRQRG (SEQ 

ID NO: 2) 



Figure 1 



underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



EAGGCTGTCCCACCCACCATCTGCACCCGCTGCAGCGCCCGCGCCCCTGTCCCGCGCCGT 
AGTCGTCATTTGTAGCCCGCCTGCCGCTCCCGGGGACCCGATCCTACCCTGGGTGCGGGG 
CAGAGCGGGCATGGCCCGTCTGGGGACCGCCTGCCCTGCGCTGGCGCTGGCCCTGGCACT 
TGTGGCGGTGGCCCTGGCTGGAGTCAGAGCCCAGGGCGCAGCCTTCGAGGAGCCTGACTA 
TTACAGCCAGGAGCTCTGGCGGCGCGGGCGCTATTATGGGCATCCGGAGCCTGAGCCGGA 
GCCGGAGCTCTTCTCGCCTTCAATGCA] TGAAGACCTTAGGGTGGAGGAGCAGGAACAGC 
AGGAGCCGCACCAGCAGGGCCACAGAACTCCCAAGAAGGCCATCAAGCCCAAGAAGGCTC 
CCAA [ GAGGGAGAAGTTAGTTGCAGAGACGCCTCCACCAG] GTAAAAATAGCAACAGAAA 
AGGCAGAAGAAGCAAGAATCTTGAGAAAGCTGCCAGTGATGACCATGGTGTCCCTGTGGC 
TCATGAGGATGTCAGAGAGAGTTGCCCACCTCTTGGTCTGGAAACATTAAAAATCACAGA 
CTTCCAGCTGCATGCCTCCACATCGAAGCGTTATGGCCTGGGAGCCCACCGGGGGAGACT 
CAACATCCAGGCAGGCATTAATGAAAATGACTTTTACGATGGGGCTTGGTGTGCTGGTAG 
GAACGACTTGCATCAGTGGATCGAAGTGGATGCCCGGCGCCTGACCAAGTTCACAGGGGT 
CATTACCCAAGGAAGGAACTCTCTCTGGCTGAGTGACTGGGTGACATCCTATAAAGTCAT 
GGTGAGCAATGACAGCCACACATGGGTTACTGTGAAGAATGGATCTGGCGACATGATATT 
TGAAGGAAAC AGTGAGAAGGAGATTC CTGTGCTC AATGAGC TGCCAGTC CCC ATGGTGGC 
CCGCTACATTCGCATAAACCCTCAGTCCTGGTTTGATAACGGGAGCATCTGCATGAGGAT 
GGAGATC TTGGGC TGC C C AC TGC CGGATC CTAATAAC TATTATC AC C GACGTAATGAGAT 
GACC AC C AC GGATGAC C TGGATTTTAAGC AC C AC AAC TATAAGGAAATGCGC C AGTTGAT 
GAAGGTTGTCAATGAAATGTGCCCCAATATTACCAGGATTTACAACATTGGCAAAAGCCA 
CCAGGGCCTGAAATTGTATGCGGTAGAGATCTCTGACCATCCTGGGGAACATGAAGTTGG 
TGAGCCCGAGTTCCACTACATCGCAGGGGCCCACGGCAATGAGGTTCTGGGACGAGAACT 
GC TGC TGC TGC TGCTGC AC TTC C TC TGC C AGGAATAC TC GGC GCAGAAC GC AC GC ATC GT 
CCGCTTGGTGGAGGAGACTCGAATCCACATTCTACCCTCCCTCAATCCTGATGGCTATGA 
GAAGGCCTATGAAGGAGGTTCCGAGTTGGGAGGCTGGTCCCTGGGACGTTGGACCCATGA 
TGGCATCGATATCAACAACAACTTTCCGGATTTAAACTCGCTGCTCTGGGAGGCAGAGGA 
CCAGCAGAATGCCCCAAGGAAGGTCCCCAACCACTACATTGCCATCCCTGAGTGGTTTCT 
GTCTGAGAATGC CACAGTGGC C AC AGAGACCAGAGCCGTCATC GCCTGGATGGAGAAGAT 
CCCGTTTGTGCTGGGAGGCAACCTACAGGGGGGTGAGCTGGTCGTGGCATACCCCTATGA 
CATGGTGCGGTCCCTGTGGAAGACCCAGGAGCACACCCCAACACCTGATGATCATGTGTT 
CCGCTGGCTGGCGTATTCCTACGCCTCCACTCACCGCCTCATGACAGATGCCAGGAGGCG 
AGTGTGCCACACGGAAGATTTTCAGAAGGAGGAGGGCACCGTCAATGGGGCTTCCTGGCA 
CACAGTGGCTGGAAGTCTAAACGATTTCAGCTACCTCCATACAAACTGCTTTGAGCTGTC 
CATCTACGTGGGCTGTGATAAATACCCACACGAGAGCGAGCTGCCGGAGGAATGGGAGAA 
TAACCGGGAGTCTCTGATTGTGTTCATGGAGCAGGTTCATCGAGGCATCAAAGGCATAGT 
GAGAGATTTACAAGGGAAAGGGATTTCAAATGCTGTCATCTCTGTGGAAGGTGTTAACCA 
TGACATCCGGACAGCCAGCGATGGGGATTACTGGCGTCTACTGAACCCTGGCGAATATGT 
GGTCACAGCCAAGGCGGAAGGCTTTATCACTTCCACCAAGAACTGCATGGTTGGCTATGA 
TATGGGAGC TAC TC GGTGTGAC TTC AC C CTC AC AAAGAC C AAC C TGGC TAGGATAAGAGA 
AATTATGGAGACATTTGGGAAGCAGCCTGTCAGCCTACCCTCCAGGCGCCTGAAGCTGCG 
GGGACGGAAAAGGCGGCAGCGTGGGTGACCCTGTCGGACACTTGAGACATACCCCAGACC 
GTGCAAATAAAAATCCACTCCAGTAGTAAAAAAA (SEQ ID N0:1) 



Figure 2A 



Gene Sequence Structure 



327 bp 



Sequence Deleted 422 bp 



Size of full-length 
cDNA: 2490 bp 




Targeting Vector* (genomic sequence) 
Construct Number: 2825 



Arm Length: 
5': 4 kb 
3': 1.5 kb 



Endogenous Locus 



CI Not drawn to scale 



■5*=arrn 



LacZ-Neo 

. Cassette 



3 * &?m 




5 ' >GGCATGGCCCGTCTGGGGACC J 
GCCTGCCCTGCGCTGGCGCTGGCC j 
CTGGCACTTGTGGCGGTGGCCCTG j 
GCTGGAGTCAGAGCCCAGGGCGCA ! 
GC C T TC G AGG AGC C TGAC TAT T AC > 
AGCCAGGAGCTCTGGCGGCGCGGG i 
CGCTATTATGGGCATCCGGAGCCT ; 
GAGC C GGAGC C GGAGC TCTTC TCG 5 
CCTTC AATGCA< 3 ' 
(SEQ ID NO:3) 



5 1 > GAGGGAGAAGTTAGTTGC AGA 
GACGCCTCCACCAGGTAACTTTTG f 
CATC GGGC AGC C C GAGGGGGC GC C j 
AGC GATC GTGGC AC TC C AGGGGAC ; 
ACCTGGCTTCCAGTATGTTTTCTT 
GAGTG AGC C C AGC C AAAGTC C TGT 
GGTGC C TGTGTTATTCCC TAGAGA 
CTACATCTGAGCTAAGTTCAGCTT 
TCTCTCCCTGC<3 1 
(SEQ ID NO: 4} 
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